Universal Uncertainty Relations 
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Uncertainty relations are a distinctive characteristic of quantum theory that impose intrinsic lim- 
itations on the precision with which physical properties can be simultaneously determined. The 
modern work on uncertainty relations employs entropic measures to quantify the lack of knowledge 
associated with measuring non-commuting observables. However, there is no fundamental reason 
for using entropies as quantifiers; any functional relation that characterizes the uncertainty of the 
measurement outcomes defines an uncertainty relation. Starting from a very reasonable assumption 
of invariance under mere relabelling of the measurement outcomes, we show that Schur-concave 
functions are the most general uncertainty quantifiers. We then discover a novel fine-grained un- 
certainty relation that is given in terms of the majorization order between two probability vectors. 
Such a vector-type uncertainty relation generates an infinite family of distinct scalar uncertainty 
relations via the application of arbitrary uncertainty quantifiers. Our relation is therefore universal 
and captures the essence of uncertainty in quantum theory. 



Uncertainty relations lie at the core of quantum me- 
chanics and are a direct manifestation of the non- 
commutative structure of the theory. In contrast to clas- 
sical physics, where in principle any observable can be 
measured with arbitrary precision, quantum mechanics 
introduces severe restrictions on the allowed measure- 
ment results of two or more non-commuting observables. 
Uncertainty relations are not a manifestation of the ex- 
perimentalists' (in)ability of performing precise measure- 
ments, but are inherently determined by the incompati- 
bility of the measured observables. 

The first formulation of the uncertainty principle was 
provided by Heisenberg PQ, who noted that more knowl- 
edge about the position of a single quantum particle im- 
plies less certainty about its momentum and vice-versa. 
He expressed the principle in terms of standard devia- 
tions of the momentum and position operators 



AXAP^-. (1) 

/ 2 w 

Robertson [2^ generalized Heisenberg's uncertainty prin- 
ciple to any two arbitrary observables A and B as 



AA-AB>-\(i>\[A,B]\i>)\. 



(2) 



A major drawback of Robertson's uncertainty princi- 
ple is that it depends on the state \tp) of the system. 
In particular, when \tp) belongs to the null-space of the 
commutator [A, B] , the right upper bound becomes triv- 
ially zero. Deutsch [3] addressed this problem by pro- 
viding an entropic uncertainty relation (EUR) in terms 



of the Shannon entropies of any two non-degenerate ob- 
servables, later improved by Maassen and Uffink j4] to 



H(A) + H(B) ^ -2logc{A,B). 



(3) 



'Electronic address 
t Electronic address 
* Electronic address 



fricdlan@uic.edu 



vgheorgh@ucalgary.ca 
gour@ucalgary.ca 



Here H(A) is the Shannon entropy [5] of the probabil- 
ity distribution induced by measuring the state \ip) of 
the system in the eigenbasis {|fflj)} of the oservable A 
(and similarly for B). The bound on the right hand 
side c(A, B) := max mjn |(a m |6 n )| represents the maxi- 
mum overlap between the bases elements, and is inde- 
pendent of the state \i/j). 

Recently the study of uncertainty relations intensified 
(see [6j for a recent survey), and as a result various im- 
portant applications have been discovered, ranging from 
security proofs for quantum cryptography [7HH], infor- 
mation locking [9j , non-locality [10] , and the separability 
problem [TT]. There were also recent attempts to gen- 
eralize uncertainty relations to more than two observ- 
ables. For this case relatively little is known [T^l - fTB] . as 
the authors investigated only particular instances of the 
problem such as mutually unbiased bases. 

In most of the recent work on uncertainty relations, 
entropy functions like the Shannon and Renyi entropies 
are used to quantify uncertainty. Such entropies are used 
in information theory to quantify asymptotic rates of cer- 
tain information processing tasks 5 , in which the con- 
cepts of typical sets play a crucial role. However, in the 
context of the uncertainty principle where a single physi- 
cal system is involved, these entropies are not necessarily 
the most adequate to use. Indeed, as we show here, other 
functions can be more suitable in providing a quantita- 
tive description for the uncertainty principle. 

Uncertainty is related to the "spread" of a probability 
distribution, or, equivalcntly, to the ability of learning 
that probability distribution. Intuitively a less spread 
distribution is more certain than a more widely spread. 
For example, in a d-dimensional sample space, the prob- 




FIG. 1: With probability r Alice samples from a random vari- 
able (blue dice), and with probability 1— r, Alice samples from 
its relabeling (red dice), but at the end of the protocol she 
"forgets" where she sampled from. The resulting probability 
distribution rp + (1 — r)np is more uncertain than the initial 
one associated with the blue (red) dice p (np). Color online. 



ability distribution p — (1,0, ... ,0) is the most certain, 
whereas the distribution q = (1/d, 1/d, . . . , 1/d) is the 
most uncertain. What are then the minimum require- 
ments that a good measure of uncertainty has to satisfy? 

In his seminal paper [3] on EURs, Deutsch pointed out 
that the standard deviation A can be increased by mere 
relabelling of the random variables associated with the 
measurements. He therefore concluded that the relation 
in (ll| can not be used as a quantitative description of the 
uncertainty principle. Following Deutsch observation, we 
assume here that the uncertainty about a random vari- 
able can not increase under a relabelling of its alphabet. 
We call this very reasonable presumption monotonicity 
under relabelling (MUR). This will be our only require- 
ment from a measure of uncertainty. 

There are several consequences of the MUR assump- 
tion. First, the uncertainty associated with a probability 
vector p can not be larger than the uncertainty associated 
with a relabelled version of it, irp, where 7r is some permu- 
tation matrix. In fact, both uncertainties are the same as 
permutations acting on a probability space are reversible. 
Second, the uncertainty can not decrease under random 
re-labelings, see Fig. [T] We therefore conclude that any 
reasonable measure of uncertainty is a function only of 
the probability vector, is invariant under permutations of 
its elements, and must be non decreasing under a random 
relabelling of its argument. 

We formulate the above requirements quantitatively 
using Birkhoff's theorem [T71 [TS], which states that 
the convex hull of permutation matrices is the class of 
doubly-stochastic matrices (their components are non- 
negative real numbers, and each row and column sums 
to 1). Birkhoff theorem thus implies that a probability 
vector q obtained from p by a random relabeling is more 
uncertain than the latter if and only if the two are related 
by a doubly-stochastic matrix, q = Dp, which is equiva- 
lent to q < p. The last equation is known as a majoriza- 
tion relation |19| and consists of a system of d inequali- 
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FIG. 2: A quantum state is measured using two orthonormal 
bases. We collect the induced joint probability distribution 
in a vector p <g> q and quantify its uncertainty in terms of 
a majorization relation, independently of the state p. Color 
online. 



ties 1 . The above discussion implies that any measure of 
uncertainty has to preserve the partial order induced by 
majorization. The class of functions that preserve this or- 
der are the Schur-concave functions. These are functions 
$ on a d-dimensional probability space, $ : M d — > M, 
for which Q(x) ^ <&(?/) whenever x -< y, Vx, y £ M. d . 
We therefore define a measure of uncertainty as being 
any non-negative Schur-concave function that takes the 
value zero on the vector x = (1,0,... ,0). The last re- 
quirement is not essential but is convenient as it ensures 
that the measure is non-negative definite. 

Our definition for a measure of uncertainty is very gen- 
eral and resulted solely from requiring monotonicity un- 
der relabellings; it also encompasses the most common 
entropy functions used in information theory, but it is 
not restricted to them. As we are not concerned with 
asymptotic regimes, we use in the following the most 
general $ to quantify uncertainty, without making any 
assumptions about its functional form. 

Having defined what a measure of uncertainty is, we 
now use it to study uncertainty relations. Let p be a 
mixed state on a d-dimensional Hilbert space H = C d . 
For simplicity of the exposition, we first consider two 
basis (projective) measurements. We denote the two or- 
thonormal bases of W. by {\a m )} m =i an d {|&n)}n=i- We 
also denote by p m (p) = {a m \p\a m ) and q n (p) = (b n \p\b n ) 
the two probability distributions obtained by measuring 
p with respect to these bases. We collect the numbers 
Pm(p) and q n (p) into two probability vectors p{p) and 
q(p), respectively. The goal of our work is to bound the 
uncertainty about p(p) and q(p) by a quantity that de- 
pends only on the bases elements but not on the state 
p. The object of our investigation is therefore the joint 
probability distribution p(p) <£) q(p). 



A vector x £ R is majorized by a vector y £ 



and write 



y 



J- for all 1 < k € d - 1, 



k 

The down-arrow notation denotes 



x ~< y, whenever y] ■_] j;t ^ y] 

with Y.U 4 = T.Uv] 

that the component of the corresponding vector are ordered in 
decreasing order, x| ^ x^ ^ ■ ■ ■ JS x. d - 



The main result of our article is an uncertainty relation 
of the form 



p(p) ® q(p) -< W, Vp, 



(4) 



where u> is some vector independent of p that we explic- 
itly calculate. We call J4j a universal uncertainty relation 
(UUR) as, for any measure of uncertainty $, 



*(p(/9)<8>g(p)) >«(&>), Vp. 



(5) 



The UUR B generates in fact an infinite family of uncer- 
tainty relations of the form pi, one for each <I>. The right 
hand side of ([5]) provides a single-number lower bound on 
the uncertainty of the joint measurement results. When- 
ever $ is additive under tensor products (e.g. Renyi en- 
tropies), §5§ splits as 



$0pG>)) + *(«(p)) >«(«). 



(6) 



A vector u) op is optimal for the UUR Q whenever 
u; op -<! u) for all w that satisfy Q. We now show, by 
construction, that an optimal u) op exists. Let I k C [d] x 
[d] be a subset of k distinct pair of indices (m,n), where 
[d] is the set of natural numbers ranging from 1 to d. Let 



ilfc := max max 
ik p 



Y^ Pm(p)q n (p), 



(7) 



(m,ra)Glfc 



where the outer maximum is over all subsets Ik with 
cardinality k and the inner maximum can be taken only 
over pure states. Then the optimal d 2 -dimensional vector 
u) op for the UUR Q is given by 



u op = (^1,^2 -Qi,...,Qd-tod-i,0, ••-,0). 



(8) 



The components of u) op are ordered in decreasing order 
ill ^ fl 2 — £li ^ Q3 — tl 2 ^ ■ ■ ■ ^ &d — Sld-i- Moreover, 
ild = 1. We defer the proofs to the Appendix. 

The quantities £!& in nfh can be in general difficult 
to calculate explicitly, as they involve an optimization 
problem. However, in the Appendix we show that the 
first two elements can be computed explicitly as 



^li = \[l + c] 2 



Sh = \[l + d? 



(9) 



where c := max m> „ |(a m |6„)|, and 



c := max 



V\(a m \b n )\ 2 + \(a m ,\b n ,}\< 



where the maximum is taken over all indexes m = m! 
and n 7^ n' , and over all indexes n = n' and m =/= ml ' . 
For k > 2, we upper bound each fi^ in (l7| by 



fl k ■= max max ^ p m (p) ^ ^P) 
\n\+\s\=k+i ' \meiz / Ws / 



1 

- max 

4 K,S 

\n\+\s\=k+i 



y^ \a m )(a m \ + ^ \ b n){bn 



men 



nes 



(10) 
(11) 



where TZ (S) are subsets of distinct indices from [d], \1Z\ 
(\S\) denotes the size (number of elements) of 1Z (\S\), 
and || • I joo denotes the infinity operator norm - the max- 
imum eigenvalue of its argument. Moreover, fid = 1. 
Note that fl k = ttk for k — 1, 2, and otherwise flk < &k 
since for a fixed p, all the terms in the sum of ffh are 
strictly contained in the expression (10). The equality 



in (11) is non-trivial and follows from the main techni- 



cal Theorem of this article (proved in the Appendix): 
max p Tr(p^)Tr(pi?) = \\\A + B||^,, for two projections 
A and B. 

Similar to the definition of the optimal vector u) op in 
([8]), we construct the vector u> as in ^ by replacing flk 
with fifc. Since flk and flk are the same for k = 1,2, 
the first two terms of u3 op and u> are the same. A simple 
calculation (see Appendix) shows that u) op -< w, which 
yields 



p(p) ® g(p) -< ^c 



-<w, 



Vp. 



(12) 



Therefore u> provides a weaker lower-bound for the UUR 
(HI), trading optimality for simplicity of calculation. 

To appreciate the generality of our UUR Q , we com- 
pare in Fig.[3]the best known lower bounds for the uncer- 
tainty of the measurement in two bases with our induced 
uncertainty relation ^, in which we take <fr to be the 
Shannon entropy H. We consider the region in which 
c > 0.83, for which the best known bound 20 has an 
explicit analytical form. We note that our bound over- 
performs [SO] in a large number of instances (around 90% 
of the time). For c < 0.83, our bound tend to be slightly 
worse than |20j , but this is expected since our uncertainty 
relation is valid for all measures of uncertainty and is not 
optimized for a specific one such as Shannon's. Next we 
take $ = Hoc in (TSl) and note that we recover Maassen's 
and Uffink bound~4 for the minimum entropy, which is 
tight. Finally, choosing $ = H a (Renyi-a entropy) in (5J) 
provides yet a novel cntropic uncertainty relation valid 
for all values of the parameter a. 

We now extend our results to the most general case 
of L > 2 positive operator valued measures (POVMs). 
Denote by {II$}^ =1 the f-th POVM, with 1 < £ < 
L. The quantity Ng denotes the number of elements in 
the Z-th POVM, and the index ctg labels its elements, 
with ag = 1,2, ..., Ng. A measurement of p with the l- 
th POVM II ^ induces a probability distribution vector 
p W(p) = (p[ e \p),p ( 2 e \p),...,p^](p)). We discover a UUR 
of the form 



)P W (P)^U>, 



Op; 



Vp, 



(13) 



where the quantity on the left hand side represents the 
joint probability distribution induced by measuring p 
with each POVM II W . Here 



<^op — (^1, f^2 — Oi, O3 



n 2 



,fi 



N 



^v-i), (14) 
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FIG. 3: For each run we randomly generate two orthonormal 
bases and a random state \ip) in a 6-dimensional Hilbert space, 
then compute our lower bound H(ui) (right hand side of 15b, 
using the Shannon entropy If as a measure of uncertainty); 
c = max m: „ |(a m |6„}| denotes the maximum overlap between 
the two bases. Color online. 



where N = N X N 2 ■ ■ ■ N L , and for k = 1, 2, ..., N 



Qk '■= max max 

ik p 



(a ly ...,a L )eXk 

(15) 
where X& C [iVi] x [N 2 ] x • • • x [Nl] is a subset of k 
distinct string of indices (ai, ..., a/,) (here [Nj] is the set 
of natural numbers ranging from 1 to Nj ) . 

Since the above quantities S\. can in general be difficult 
to calculate explicitly, we have found tight upper bounds 
f2fc that do not involve an optimization over all states p. 
Our upper bound Q& < fi* is given by 



n k := 



max 
Si,-,Sl 

JZLi\Se\=L+k- 



e(e<) 

1=1 \atGSe / 



1 



<1, 



(16) 

where Si denotes a subset of distinct indices from [Ni], 
\Si\ denotes the size (number of elements) of Sg. Note 
that by definition il^ — 1. Defining u> as in (14) by 
replacing Q& with fl k gives u> op ~< u>. We can therefore 
use ijj instead of the optimal u) op in Eq. (13 1. 



Note that an L > 2 measurement uncertainty relation 
can be trivially generated by a summing pairwise two- 
measurement uncertainty relations, one for each pair of 
observables. Our UUR ( 13 ) is more powerful and is not 
of this form. This fact can be seen most clearly in a 
set of measurement operators in which any two observ- 
ables share a common eigenvector. In this case, a two- 
measurement uncertainty relation will provide a trivial 
lower bound of zero, hence the pairwise sum must also 
be zero. However, the vector ui in ( fl~4| is in general differ- 
ent from (1, 0, . . . . , 0), (see Example fl] in Appendix |B| , 
thus providing a non trivial bound for the UUR (13 1 (or 



the induced family obtained by applying various uncer- 
tainty measures <& on it). Finally, the UUR (13) is not 
restricted to MUBs or particular values of L, but is valid 
for any number of arbitrary bases. 

To summarize, we discovered two novel uncertainty re- 
lations, Q - valid for measurements in two orthonormal 
bases, and ( p~3| ) - the generalization of the former to the 
most general setting of L POVMs. Our relations are 
"fine-grained" ; they do not depend on a single number 
(such as the maximum overlap between bases elements), 
but on all components of vector u;, which we compute 
explicitly, via a majorization relation. Our uncertainty 
relations are universal and capture the essence of uncer- 
tainty in quantum mechanics, as they are not quantified 
by particular measures of uncertainty such as Shannon 
or Renyi entropies. 

We did not explore here which bases provide the most 
uncertain measurement results for the UURs. One may 
conjecture that MUBs are the suitable candidates. In- 
deed, this seems to be the case, and we conjecture that 
the optimal flk in ^ is given by f2& — |(1 + \/k/d) 2 , 
which can then be used to construct (see Q) the opti- 
mal w^l UB for the UUR Q. The conjecture is strongly 
supported by numerical simulations. Moreover, we ob- 
served that for bases that are not MUBs, the best uj we 
were able to find (numerically) always majorizes u)^fp B , 
i.e. u^p VB -< u>. This provides strong support for the ini- 
tial assumption that MUBs provide the most uncertain 
measurement outcomes. 
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Supplementary Information 
Universal Uncertainty Relations 

Here we derive and elaborate on the main results discussed in the paper. The Appendix consists of three parts. 
In Sec. [A] we prove the results for two orthonormal bases. In Sec. [B] we prove the results for L > 2 POVMs. In 
Sec. [O we discuss a possible generalization of our results to the case where each measurement is applied according to 
a pre-probability distribution; we call such relations weighted uncertainty relations. 

Appendix A: Two orthonormal bases 

We first show that the components of u) op in Eq. ([8]) of the main manuscript are ordered in decreasing order, 
17i ^ r?2 — fii ^ ^3 — ^2 ^ • • • ^ &d — f^d-i- Note that the first inequality in the chain, J7i ^ f22 — fii, is equivalent 
to fii + f2i ^ ri 2 - This latter inequality is always true by construction, see Eq. (17| of the main manuscript, since J7 2 
contains a sum with two terms, each of which cannot be greater than fii. The other inequalities can be written as 

fi fe + fi fc ^ fi fc _i + fifc+i, (Al) 



for all 1 < k < d. Observe now that both sides of ( Al ) consist of sums of 2k terms of the form given in Eq. Q of 



the main manuscript. The sum of the first k terms corresponding to fifc+i is no larger than fi^ (by construction). 
The sum of last remaining term from fi^+i and the fc — 1 terms from £lk-i is again no larger than fi^ (which was by 



construction the greatest possible value a sum of k terms can have), and this proves ( Al ). 

The fact that fi^ = 1 follows from the observation that, for orthonormal bases, one can always find a state p for 
which p — (1, 0, . . . , 0) (choose the state to be the projector onto one of the basis elements, e.g. p — \a\) {a\\). In this 
case, the expression in Eq. ¥i\ of the main manuscript for fi^ achieves its maximum possible value of 1. 

It now follows at once that u> op in Eq. ([8]) of the main manuscript is optimal for the UUR Q (main manuscript), 
since the sum of its first k largest components equals fifc > which, by construction, can not be smaller than the sum of 
the first k largest components of p(p) (8> q{p) in Q (main manuscript); moreover, u> op is normalized. 

We next prove the main Theorem, which we then use to compute the first two components fii and fi 2 in Eq. ^ 
of the main manuscript. 

Theorem 1. Let A and B be two orthogonal projectors, A = A 2 — A^ , B = B 2 — B^ , acting on C d . Then, 

maxTr( P A)Tr( P B) = ^WA + Bf^ < 1, (A2) 

p 4 

where the maximum is taken over all density matrices (i.e. positive-semidefinite matrices with trace 1) acting on C d , 
and || • | joo denotes the operator norm (the maximum singular value of its argument) . 

In order to prove Theorem [l] we first prove a weaker version of it in the following lemma. 

Lemma 1. Let \a){a\ and \b)(b\ be two rank-one projectors acting on C d . Then, 

max(a|p|a)(&|p|fo> = 1(1 + \(a\b)\) 2 = \ \\\a)(a\ + \b){b\\& , (A3) 

where the maximum is taken over all d x d density matrices p. 
Proof. First note that 

(a\p\a)(b\p\b) = (y/(a\p\a)(b\p\b)y < \ (Tr [p(\a)(a\ + \b)(b\)]f < \ \\\a){a\ + |6)(6|||^ , (A4) 

where the first inequality follows from the geometric-arithmetic mean inequality and the second inequality from the 
standard property of the operator norm. 

We next construct a state p for which the inequality above is saturated. We show that the maximum can be 
achieved with a rank one matrix, p — \x)(x\. Let |5) be a normalized vector such that the set {\a), \b)} is orthonormal 
and the vector \a) is in the span of {\a), \b)} (this can be done using the Gram-Schmidt process). We can therefore 
assume that also the optimal \x) is in the span of {\a}, \b)} and expand it as 

\x) = ff+r\b) + y/l-r 2 e w \a), (A5) 



with ^ r ^ 1 and 6,4> real numbers. We choose cj) such that e 1 ^(a|6) = |(c 
|(a|a)| = yl — c 2 . We then obtain 



c and take 9 so that e" 



a\p\a)(b\p\b) = |(a|x)| 2 |(6|x)| 2 = r 2 {rc+^/l - rVl - c 2 ) 2 . (A6) 

r and cos(/3) = c. Note that without loss of 



To optimize the function above over the variable r we denote cos(a) 
generality we can take < a, f3 < ir/2. Therefore, 

(a\p\a)(b\p\b) = cos 2 (a) [cos (a) cos(/3) + sin(a) sin(/3)] = cos 2 (a) cos 2 (c 



■p) = ^[cos(2a-/3)+cos(^)] 2 



Since a is determined by r, we can choose r such that a = /3/2. For this choice of r we get 

{a\p\a){b\p\b) = \{l + c)\ 



which proves the first equality in ( A3 1 . The second equality in ( A3 1 follows from direct calculation 



Proof of Theorem [T] Note first that 



maxTrO^Tr^B) < max - (Tr [p(A + B) 
p p 4 



f^-\\A- 



B\ 



<1, 



(A7) 

(A8) 

□ 

(A9) 



where the first inequality follows from the geometric-arithmetic mean inequality and the second one from Tr[/o(A + 
B)] < J2i=i ^i{p)^i(A + B) ^ ||A + _B|| 00 Tr(p) = HA + BHoo, where Xi(») denotes the j-th eigenvalue of its argument. 



The last inequality in (A9) is trivial and follows at once from the triangle inequality for the norm, \\A + B\ 
+ 1 1 -B Hoc = 2 (since A and B are projectors) 



It is therefore left to show that the maximum in (A2) can be achieved. In fact, we will show now that it can be 
achieved with a pure state p = |x)(x|; that is, 



^{x\A\x){x\B\x) = -\\A + B\\l ^l 

\x) 4 



(A10) 



To see it, let A'H and BW, be the subspaces onto which A and B project, respectively. Now, if AH^BH ^ {0} 
(i.e. the subspaces have a non-trivial intersection) then there exist some |xo) that belongs to both AH and BH, and 
therefore ^4|xo) = |xo) and S|xo) = |xo). Hence, in this case, \\A + B||oo = 2 and the maximum in ( A10[ ) is achieved 
for |x) = |xo). We therefore assume in the rest of the proof that AH f] BH = {0}. 

Since A and B are projectors, UA+UHoo ^ 1. We first prove the theorem for the simpler case in which || A+-B||oo = 1. 
In this case AB = BA = 0. To see it note that V|y) G AH, (y\A + B\y) = 1 + (y\B\y) = 1 (since the norm is 1) and 
therefore (y\B\y) = V\y) € AH. This implies that BA\v) = V\v) G H and therefore BA = 0. Simila rly, AB = 0. 
We need to show now that whenever \\A + B||oo = 1 the maximum value of the left hand side of (AlOl is 1/4. One 
can check this by wri ting |x) = o.\xa) + P\xb) where \xa) G AH and \xb) G BH, with (xa\xb) — 0. The product on 
the left hans side of (AlOl becomes |a| 2 |/3| 2 , with maximum value 1/4. The right hand side is trivially equal to 1/4, 



and the Theorem is therefore proved for this case. 

The last case that is left to prove is when AH f] BH = {0} and ||j4 + B\ 
which 



and let 



(A + B)\y) = (\\A + B\U\y}, 



B\y) 

\B\y)\\ 















|o> = 


\\A\y)\V |6) 


that both 
that 


\\A\y)\\ 


and 


\B\y)\\ 


are 


non 


-zero 


since || A + B 



> 1. We define \y) to be the vector for 

(All) 

(A12) 



\\A + B\\ 00 = \\\a){a\ + 
Indeed, since \a){a\y) = A\y) and \b)(b\y) — B\y) we get 

(|o)(o| + l&X&Dly) = (A + B)\y) ={\\A+ B\U\y), 



> 1, so A\y) ^0 and similarly B\y) ^ 0. We next 



(A13) 



(A14) 



which implies 



On the other hand, 



|o)(o| + |6)(6|||oo>||^ + B| 



(A15) 



|a)(a| + |&)(6||| 00 SC||A + .B|| 00 , 



(A16) 



since A ^ [ g){g| and B ^ \b)(b\ (see the definition of \a) and \b) in ( |A12[ )). Combining (A15) and (A16) yields the 
equality in (A13). 

Since A ^ \a)(a\ and B ^ |&)(&|, we also have Tr(pA)Tr(pB) ^ (a\p\a)(b\p\b), and taking the maximum over all p 
yields 



maxTr(pA)Tr(pB) > max(a\p\a)(b\p\b) = -\\\a)(a\ 
p p 4 



^ = -|L4 + J B||4„ 



(A17) 



where the first equality follows from Lemma [T] and the second equality from (A13|. Combining (A9) and (A17) 
completes the proof of Theorem [T] □ 



1. Exact expressions for f^ and 0,2 

We now have all ingredients for computing f^ and fi 2 - Note that 

Qi = maxmaxp m (p)g„(p) = maxmaxTr(/9|a m )(a m |)Tr(p|6„)(6 n |) 

m,n p m,n p 

= max- |||a m ){a m | + |&n)(^n||| 00 = max- [1 + |(a m |6„)|] - '"' - '" 

m, n 4 7?i, n 4 



-ll + cf 



(A18) 



where the first equality on the second line follows from Theorem [T] and the second equality on the second line follows 
from the arguments in the last part of the proof of Theorem [T] 
To compute f2 2j first observe that, for fixed m and k, 



max 

p 



max 

p 



p m (p)^2q n (p) 

71=1 

a m )(a m \ + ^2 \b„)(b 



Tr( / o|a m )(a m |)Tr p^ \b n ){b n 



\ ^\{a m \b n 

\ n=l 



(A19) 



where the third equality follows from Theorem [T] and the last one from the arguments in the last part of the proof of 
Theorem [Tl By symmetry, the above equations hold under interchanging p and q. Next note that 



p m q n +Pm>q n > ^ (q n + g„') max {Pm,Pm'} 



(A20) 



and also 



p m q n +p m >q n ' < (p m + Pm>) max{q n , q n >}, 



(A21) 



where, for simplicity of notation, we dropped the p dependence. Combining (A19), (A20) and (A21| yields the 
expression ([9]) (main manuscript) for f^- 



2. The Upper Bounds Cl k 



Theorem nl can also be used to prove the exact expressions introduced in Eq. (11 1 for the upper bounds defined in 
Eq. (10). In particular, Eq.(ll) follows from Eq.(10l by substituting 



A= ^2 \ a m)(a m \ and B = ^ \bn)(b n \ 
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FIG. 4: Upper bound fi^ for n^ from a Young-like diagram. Color online. 



in Theorem [T] 

The upper bounds 51^. in ( 10 1 can be easily visualized using a Young-like diagram. Each box in the figure represents 
a component p m q n of the probability vector p <£> q, arranged in lexicographical order (we assume for simplicity that 
Pi ^ Pi ^ ' ' ' ^ Pd and <h ^ 52 ^ ' ' ■ ^ to)- For a fixed p, the upper bound ilk in ( 10 ) (main manuscript) corresponds 
to taking the maximum among the sum of the components of each 1 x D , 2 x [D — 1) , . . . , D x 1 rectangles in 
the figure, for a total of D such rectangles. We highlight 3 possible rectangles using different colours : 1 x D with 
light blue, 3 x (D — 2) with light green and D x 1 with light red. It is easy to see that the optimal flk in Q (main 
manuscript) is upper bounded by Oj., as the former is always constructed as a partial sum over some rectangle in the 
figure. 



It also follows at once that u> in (12), constructed with the upper bounds fi/., majorizes the optimal u> op in (|8l) , i.e. 



u) op -< u>, since the sum of the largest k components of u> op and u> are flk and $\., respectively, with fl^ $J il 



(§,i. 



3. An alternative upper bound 



(A22) 



We also found an alternative u/ that majorizes the optimal uj op , constructed as follows 

J = (wi,w 2 ,-..,W2,w,0,...,0), 

where Wi = Oi, W2 = ^2 — ^lj and Q = 1 — uj\ — tu)2, with £ ^ 1 the greatest integer for which Q ^ 0, i.e. £ = ~" 1 . 
Here Wj appears £ times on the right hand side of ( A22[). From the discussion at the beginning of the Appendix, 



where we showed that the optimal uj op has ordered components, it follows at once that u> op ~< u>' . However, numerical 
simulations indicate that the vector u> discussed above (which is defined in terms of £lk) induces, most of the time 
(around 80% of the cases), s trong er uncertainty relations in comparison with the us' defined here. Nevertheless, the 
simplicity of the expression ( |A22 ) makes it very useful. We also note that a/ and u> are not comparable in general 
under majorization; i.e. in general we have lj -ft, a/ and u/ -fc u>. 



Appendix B: L ^ 2 POVMs 



In the last part of the paper we extended the results valid for two orthonormal bases to the most general case of 
L Js 2 POVMs, and derived the uncertainty relation (13), where w op given in (14) with components defined in ( fl5| ) 
is by construction the optimal one. As was the case before for two orthonormal bases, we do not have an explicit 
expression for the optimal u> op , however we use the upper bound (16) for Qk to construct another vector u> such that 



u> op -< u>. We now prove that indeed (16) is an upper bound on flk- Note that 



max max ^ pQ (p)pW ( p ) . . . p W ( p ) = m ax max ^ J| Tr [pll 

(ai,...,Qi)6l fc (ai,...,QL)6l t £=1 



< max max 

S-L,...,S L p 



£*=i \s t \=L+k-i 



max 

Si,...,Sz. 






nxr 

1=1 \a e eS e / 



^ max max Tr 

Si,—,Sl p \ 

Efci |S/|=i+fc-i x 



^ 1. 



^ £=1 Q f e5 f 



w 



(Bl) 



The first inequality is a simple consequence of the fact that Ik C <Si x • • • xSl. The second inequality follows from the 
geometric-arithmetic mean inequality, the third inequality follows from the properties of the infinity norm, and the 
last inequality follows from the fact that the operator inside the norm is smaller or equal to the identity. Therefore 
^k ** &k- Unlike the case of two orthonormal bases, where the first two upper bounds coincide with the optimal ones, 
ill = S^i and 512 = ^2, for the case of POVMs this is no longer true, and in general the upper bounds ttk are strict for 
all k. The reason behind this fact is that Theorem [I] does not admit a generalization to more than two measurements. 
In the following example we show that our L > 2 POVM uncertainty relation ( 13 1 is much stronger than the one 



obtained by summing pairwise two-measurements uncertainty relations, one for each pair of observables. 
Example 1. Consider the following three orthonormal bases in a four dimensional Hilbert space 

S 1 = m^,|2),|3)} 
B 2 = 



10} 



|2) + |3) 
V2 



6, 



|2) + |3) 
V2 



|1) + |2)-|3) 2|1)-|2) + |3) 

73 ' Vq 

|0) + |2) - |3) 2|0) - |2) + |3} 



73 



76 



(B2) 



Note that each pair of bases share a common eigenvector (boxed above), and the three common eigenvectors are all 
different: |0), J and |1). For this case, the upper bounds in (16) are il\ = 0.78 and flk — 1, for k > 1. Therefore 
u) = (0.78, 0.22, 0, . . . , 0), and our uncertainty relation ( 13 ) is non-trivial, whereas a sum of pairwise two-measurements 



uncertainty relation provides a trivial lower bound of zero (as each pair of basis share a common eigenvector). 



Appendix C: Weighted uncertainty relations 



Entropic uncertainty relations of the form 



5>F(pW(p)) > C. 



(CI) 



were also considered in the literature (see [B] and the references within). Here H is the Shannon or Renyi entropy, 
Y]e—i tt = 1) t£ > is a probability associated with the measurement £, and C is some positive constant. This is a 
generalization of the original entropic uncertainty relation pi) to the case in which there are L measurements not all 
with the same weight. In this section we incorporate this unevenness in the weights of the distinct measurements into 
our UURs formalism, and show that the UUR given in Eq. |i"3| ) of the main text also produces entropic uncertainty 
relations of the form (CI ). 

Indeed, consider L integers {we}e=i,...,L and set W = J2e=i w ?- Consider also W measurements (not all distinct) 
such that for each £, wg of them are identical. Thus, in this case of such W measurements, Eq. (IT3| takes the form: 



P {e) (p) 



■< w, 



op- 



Vp. 



(C2) 



Now, by applying an additive measure of uncertainty $ on ( C2 ) we get 



L 



Wl& 



pW(p) >*M, 



(C3) 



or, equivalently, 



5>*[p w (p)1 > w *(?>) 



(C4) 



The quantities tg = wg/W form a probability distribution and the uncertainty relation (C4) can thus be seen as 
a "weighted" one. Thus, the vector uncertainty relation (C2) is a universal version of the entropic uncertainty 
relation (CI ). Note, however, that the UUR given in (C2) is not a new one but simply follows from our main result 



in Eq. (13) when applied to the special case where not all of the measurements are distinct. 



